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Abstract. We prove an asymptotic estimate for the number of m x n non- negative 
integer matrices (contingency tables) with prescribed row and column sums and, 
more generally, for the number of integer feasible flows in a network. Similarly, 
we estimate the volume of the polytope of m x n non-negative real matrices with 
prescribed row and column sums. Our estimates are solutions of convex optimization 
problems and hence can be computed efficiently. As a corollary, we show that if row 
sums R = (ri, . . . , rm) and column sums C = (ci, . . . ,Cn) with ri + . . . + Tm = 
ci + . . . + Cn = N are sufficiently far from constant vectors, then, asymptotically, in 
the uniform probability space of the m x n non-negative integer matrices with the 
total sum N of entries, the event consisting of the matrices with row sums R and the 
event consisting of the matrices with column sums C are positively correlated. 



1. Introduction and main results 

Let m > 1 and n > 1 be integers and let R = (ri, . . . , rm) and C = (ci, . . . , Cn) 
be positive integer vectors such that 



We are interested in the number i^{R, C) of m x n non- negative integer matrices, 
also known as contingency tables, with row sums R and column sums C, called mar- 
gins. Computing or estimating numbers ^{R,C) has attracted a lot of attention, 
because of the relevance of these numbers in statistics, see [Goo76], [DE85], com- 
binatorics, representation theory, and elsewhere, see [DG85], [DG04]. Of interest 
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are asymptotic formulas, see [BBK72], [Ben74] and most recent [CM07a], [GM07], 
algorithms with rigorous estimates of the performance guarantees, see [DKM97], 
[Mor02], [CD03], [BLV04], and heuristic approaches which may lack formal justifi- 
cation but tend to work well in practice [Goo76], [DE85], [C-l-05]. 
Our first main result is as follows. 

(1.1) Theorem. Let R = (ri,... ,r^) and C — (ci,... , c^) be positive integer 
vectors such that ri + . . . + rm = ci + . . . + Cn = N . Let us define a function 




for X = {xi, . . . ,Xm) and y = (yi, . . . , yn) • 
Then F(x, y) attains its minimum 

p = p{R, C) = min F(x, y) 

0<a;i,... ,Xm<l 
0<yi,... ,j/„<l 

on the open cube < Xi, yj < 1 and for the number ij^{R^ C) of non-negative integer 
m X n matrices with row sums R and column sums C we have 

p > #{R,C) > iV-^^'"+"V, 

where ^ > is an absolute constant. 

More precisely, the lower bound we prove is 

p ( m+n \ / o \ m+n—1 



2e^n''^mn{N + mn) \{mn)^{N + 1){N + mn) 



X 



yvrf] (AsM N\{N + mn)\{mny 
li r.\ I \ li Cj\ j N^{N + mn)^+^^{mn)V^ ' ' 



provided m + n > 10. Recall that from Stirling's formula 

gS \ / 

and hence the product in front of p{R, C) indeed exceeds N~'^^'^'^'^^ for some ab- 
solute constant 7 > 0. 

We note that the substitution Xi = e~** and yj = e~^^ transforms the problem 
of computing p into the problem of minimizing the convex function 

m n 

0(t, s) = (/>K,c(t, s)^Yl rit, + CjSj - ^ In (1 - e"*^""^ ) 



on the positive orthant Si,tj > 0, so that methods of convex optimization can be 
applied to compute p in time polynomial in m + n and In TV, see [NN94]. 

Theorem 1.1 estimates the number ^{R,C) of contingency tables within an 
]\j-0{m+n) fo^QiQj-^ This estimate provides, asymptotically, the main term 
of log #(i?,C) for all but very sparse cases, where margins and cj are small 
compared to the sizes m and n of the matrix. For example, if the margins and cj 
are at least linear in m and n then C) is at least as big as 7"^"" for some con- 

stant 7 > 1. By now, the sparse case of small and Cj is well understood, thanks 
especially to the recent paper [GM07] . The case of moderate to high margins seems 
to be the most difl&cult. To the author's knowledge, the estimate of Theorem 1.1 
is the only rigorously proven effective estimate of ^{R,C) for generic R and C (if 
all r^'s are equal and all c-,'s are equal, recent paper [CM07a] provides a precise 
asymptotic formula for the number of tables). Theorem 1.1 allows us to find faults 
with the very intuitive "independence heuristic" for counting contingency tables 
and points out at some strange "attraction" phenomena in the space of matrices. 
Quite counter-intuitively, we conclude that in the uniform probability space of the 
m X n non-negative integer matrices with the total sum of entries equal to A^, the 
event consisting of the matrices with row sums R and the event consisting of the 
matrices with column sums C attract exponentially in mn provided the vectors 
R and C are sufficiently far from constant vectors, see Section 2 for the precise 
statements and details. 

Let us identify the space of m x n real matrices X = (xij) with Euclidean space 
for d = mn. In M*^ we consider the transportation polytope V = C) defined 
by the equations 



n 



Xij = Ti for z = 1, . . . , m, Xij = Cj for j = 1, . . . ,n 

j=i i^i 

and inequalities 

Xij > for all 

As is known, P is a polytope of dimension (m — l)(n — 1). We prove the following 
estimate for the volume of V, computed with respect to the Euclidean structure in 
the affine span of V, induced from R^. 

(1.2) Theorem. Let R = (ri,... ,r^) and C — (ci,... , c„) be positive integer 
vectors such that ri + . . . + rm = ci + . . . + Cn = N and let V = V{R, C) he the 
polytope of non-negative mxn matrices with row sums ri , . . . ,rm and column sums 

Cl 5 • • • 5 Cfi . 

Let 

P = /3{R, C) = max TTxy 
xev 

be the maximum value of the product of entries of a matrix from V. Then for the 
volume ofV we have 
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where 7 > zs an absolute constant. 



From our proof more precise bounds 



vol(P) > i!^/? 

" 2e3v&7r'''^Ar^+^-i {mn)\ ^ 



and 



mn 



2eA^+"-2(mn)2'^+2"-^/2 (mn) 
where 

, . „ n N m N 
A = A(it, = — max h — max — 

2 i=l,...,m 2 i=l,...,n Cj 

follow. When the margins are scaled, (i?, C) 1 — (ti?, tC) for /; > 0, the volume of 
V and both the upper and the lower bounds get multiplied by t'^'™'^. 
Computing fi reduces to finding the maximum of the concave function 



= J]ln 



Xij 



on the transportation polytope V and hence can be done efficiently (in time poly- 
nomial in m + n and In N) by existing methods [NN94] . 

Computing or estimating volumes of transportation polytopes has attracted con- 
siderable attention as a testing ground for methods of convex geometry [Sch92], 
combinatorics [PakOO] , analysis and algebra [BLV04] , [BP03] , [DLY03] . In a recent 
breakthrough [CM07b] , Canfield and McKay obtained a precise asymptotic expres- 
sion for the volume of the Birkhoff polytope (when — Cj = 1 for all i and j) and 
in the more general case of all the row sums being equal and all the column sums 
being equal. If = C = (1, . . . , 1), the formula of [CM07b] gives 



""^^= (2.)--VMn-l)- ^"p{^^"'^^ 



whereas the formula of Theorem 1.2 implies that, ignoring lower-order terms, we 
have 



vol 7^ 



in that case (since by symmetry the maximum (3 of the product of coordinates 
Xij is attained at Xij = 1/n). Theorem 1.2 seems to be the only rigorously proven 
estimate of the volume of the transportation polytope available for general margins. 

We note that from the purely algorithmic perspective, volumes of polytopes and 
convex bodies can be computed in randomized polynomial time, see [Bol97] for a 
survey. 
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Theorem 1.1 can be extended to counting with weights. 

Let us fix a no n- negative matrix W = (wij), which we call the matrix of weights. 
We consider the following expression 

T{R,C;W)= J2 

where the sum is taken over all non-negative integer matrices D with row sums R 
and column sums C and where we agree that 0^ = 1. For example, if Wij G {0, 1} 
for all i,j then T[R, C; W) is the number of m x n non-negative integer matrices 
D = (dij) with row sums R, column sums C and such that dij = whenever 
Wij = 0. This number can also be interpreted as the number of integer feasible 
flows in a bipartite graph with vertices ui, . . . , Um and vi, . . . ,Vn and edges (tt^, Vj) 
whenever Wij = 1 that satisfy the supply constraints at Ui and the demand 
constraints Cj at Vj. Counting integer feasible flows in non- bipartite networks can 
be reduced to that for bipartite networks. For example, if Wij = 1 for j <i + l and 
Wij = elsewhere, T(i?, C; W) is the Kostant partition function, see [Ba07], [Ba08] 
for more examples and details. We also note that T(i2, C; 1) = C), where 1 
is the matrix of all I's. 

We prove the following extension of Theorem 1.1. 

(1.3) Theorem. Let R — (ri,... ,rm) and C — (ci,... , c„) be positive integer 
vectors such that ri + . . . + rm = ci + . . . + Cn = N and let W = (wij) be an m x n 
non-negative matrix of weights. Let us define a function 



F(x,y;W^)= J] 



and let 



for yi= {xi,. . . ,Xm) and y = (yi, • • • , ^n) 
p = p{R,C;W)= inf F{^,y-W). 

yi,--- ,2/n>0 

WijXiUjKl for all i,j 

Then, for the number T{R, C; W) of weighted non-negative integer matrices with 
row sums ri, . . . , r^n and column sums ci, . . . ,Cn, we have 

p > T{R,C;W) > Ar-T(^+")p, 

where ^ > is an absolute constant. 

More precisely, the lower bound we prove is 

p ^ m+n ^ / 2 \m+n—l 

T{R,C;W) > 2e5^i:^J^(^^^^) [{mnr{N + l){N + mn)) 

f.cf\ NljN + mny.jmnr- 
fJi^^- J \f}i''j- / N^{N-\-mn)^+^^{mn)r^ ' ' ' 
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provided m-\- n > 10. 

As in Theorem 1.1, substituting Xi = e~*^ for i = 1, . . . ,m and yj = e~^^ for 
J = 1, . . . , n we reduce the problem of computing p to the problem of finding the 
infimum of the convex function 

m n 

(/)(t,s) = (i)R^c{t,s) = J2^^^i + J2^^s^ 

i=l j=l ij 

on the convex polyhedron 

Si + tj > \nwij for all 

Again, the value of p can be computed efficiently, both in theory and in practice, 
by methods of convex optimization, cf. [NN94] . 

For positive matrices W = (wij) the infimum p{R, C; W) in Theorem 1.3 is at- 
tained at a particular point and there is a convenient dual description of p{R, C; W). 

(1.4) Lemma. Let V = V{R, C) be the transportation polytope of the m x n non- 
negative matrices X = (xij) with row sums R and column sums C and let us fix an 
m X n positive matrix W — {wij) of weights, so Wij > for all i,j. For an m x n 
non-negative matrix X — (xij) let us define 

g{X; W) (^{xij + 1) In {xij + 1) - Xij Inxij + Xij Inwij^ . 

Then g{X; W) is a strictly concave function of X and attains its maximum on V 
at a unique positive matrix Z = Z{R, C; W). One can write Z = (zij) in the form 

Zij = — for all I, J 

1 - Wij^iVj 

and positive Ci) • • • i ^m] Vii ■ ■ ■ ^Vn such that Wij^iijj < 1 for all i and j and such 
that the infimum p{R, C; W) in Theorem 1.3 is attained at x* = (^i, . . . ,^rn) o-'^'d 
y* = (ryi,... ,?7n)- 

p{R,C- W^) = F(x*,y*;H^). 

Moreover, we have 

p{R,C;W)^e^^[g{Z;W)]. 
In particular, if Wij = 1 for all then 

g{X) = g{X; 1) = ^({xij + 1) In (xy + 1) - Xij Inx^^ and 
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^ij = — 0.11 hJi 



where x* = (^i,--- ,^m) and y* = {rji, . . . , ry^) is a point where the minimum 
p{R,C) = F(x*,y*) in Theorem 1.1 is attained. Additionally, 

p(i?,C) = exp{^(Z)}. 
The paper is structured as follows. 

In Section 2, we consider consequences of Theorems 1.1 and 1.2 for the "inde- 
pendence heuristic". The heuristic was, apparently, first discussed by Good, see 
[Goo76]. It asserts that if we consider the space of non-negative integer m x n 
matrices with the total sum N of entries as a probability space with the uniform 
measure then the event consisting of the matrices with the row sums ri, . . . , r^^ is 
"almost independent" from the event consisting of the matrices with the column 
sums Ci, . . . , c^. We show that if the row sums and the column sums Cj are suffi- 
ciently generic then the independence heuristic tends to underestimate the number 
of tables as badly as within a factor of 7"^"^ for some absolute constant 7 > 1. 
We see that in fact (rather counter-intuitively), instead of independence, we have 
attraction (positive correlation) of the events. 

In Section 3, we state a general result (Theorem 3.1), which provides a reason- 
ably accurate estimate for the volume of the section of the standard simplex by a 
subspace of a small codimension. Theorem 3.1 states that in a sufficiently generic 
situation the volume of the section is determined by the maximum value of the 
product of the coordinates of a point in the section. This estimate immediately 
implies Theorem 1.2 and is one of the two crucial ingredients in the proofs of The- 
orems 1.1 and 1.3. Theorem 3.1 appears to be new and may be interesting in its 
own right. 

In Section 4, we state some preliminaries from convex geometry needed to prove 
Theorem 3.1. 

In Section 5, we prove Theorems 3.1 and 1.2. 

In Section 6, we describe the second main ingredient for the proofs of Theorems 
1.1 and 1.3, the integral representation from [Ba07] and [Ba08] for the number 
#(i?, C) of tables and the number T(i?, C; W) of weighted tables. 

In Section 7, we prove Theorems 1.1 and 1.3 and Lemma 1.4. 

In what follows, we use 7 to denote a positive constant. 

2. The independence heuristic and the 
exponential attraction in the space of matrices 

(2.1) The independence heuristic. The following heuristic approach to count- 
ing contingency tables was suggested by Good [Goo76]. Let us consider the space 
of all m X n non-negative integer matrices with the total sum of entries as a 
probability space with the uniform measure. Then the probability that a matrix 
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from this space has row sums R= (ri, . . . , r^) is exactly 



+ mn — l\ ^ Yj f Vi + n — 1 

mn — 1 / -'■-'■V n — 1 

' 1=1 ^ 

Similarly, the probability that a matrix has column sums C = (ci, . . . , c^) is exactly 

N + mn — 1\ ^ -TT /cj ■ + TO — 1 
mn — 1 / ■'■■'■V TO — 1 

Assuming that the two events are almost independent, one estimates the number 
4I^{R^C) of contingency tables by the independence heuristic I{R,C): 

For example, if m = n = 4, i? = (220,215,93,64), C = (108,286,71,127) with 
iV = 592 then 

#{R,C) = 1225914276768514 fa 1.226 x 10^^ 

see [DE85], while 

I{R,C) ^ 1.211 X 10^^ 

Given margins R — (ri, . . . , r^) and C = (ci, . . . , Cm) such that not all row sums 
Ti are equal and not all column sums Cj are equal, we will construct a sequence of 
margins {Rk,Ck), where Rk is a A;to- vector and Ck is a A;n- vector such that the 
ratio ij^{Rk, Ck)/I{Rk, Ck) grows as for some 7 = 7(-R, C) > 1. 

(2.2) Cloning margins. Let us choose some margins R = (ri, . . . , r^) and C = 

(ci, . . . , On) such that ri + . . . + Vm = Ci + . . . + Cn = N . For a positive integer k, 
let us consider the new "clone" margins 



Rk — kri, . . . , kri, . . . , krm, • • • , kvr, 



k times k times 



Ck — I kc\^ ■ ■ ■ 1 kci, • • • , kCfi, . . . , kc 



n 



k times k times 

In other words, we obtain margins {Rk, Ck) if we choose an arbitrary matrix X with 
row sums R and column sums C, consider the km x kn block matrix consisting 
of /c^ blocks X and let Rk be the row sums of and let Ck be the column sums of 
Yk- Hence we consider km x kn matrices with the total sum of the matrix entries 
equal to k'^N. 
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One can check from the optimahty condition (cf. Section 7.1) that if x* = 
(^1, . . . , and y* = . . . , 77^) is a point in Theorem 1.1 where the minimum 
p{R, C) is attained then the minimum p{Rk, Ck) is attained at the point 




Therefore, 
and by Theorem 1.1 

or, in other words, 



^ hm #(i?fc,C,)i/^'=p(i?,C), 

k — >+oo 



(2.2.1) hm ^\n#{Rk,Ck) = lnp{R,C). 

k — S-+00 K 



Let us introduce the multivariate entropy function 

d 

Pi 



d ^ 
H(pi,... ,Pd) = ^Pjln— , 



i=l 

where pi, . . . ,pd are no n- negative numbers such that pi + . . . + pd = 1. Using the 
standard asymptotic estimate for binomial coefficients (available, for example, via 
Stirling's formula) 

lim — In (^^^ — {a + b) ln(a + 6) — alna — 61n6 

we deduce from (2.1.1) that 

k^oo¥^''^^^''^'^ = 

—r 1 i = 1, ■ ■ ■ ,m 

N + mn 

(2.2.2) - (AT + mn)H (^f^^, j = l,...,n 

^ ^ \N + mn 



— r j In r j — cj In ( 
+ A/" In iV + (iV + mn) ln(iV + mn) 
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(2.3) The exponential attraction in the space of matrices. Let us choose 

margins R = (ri, . . . , r„j) and C = (ci, . . . , c„) such that not all row sums are 
equal and not all column sums Cj are equal. Our goal is to show that 

(2.3.1) _ lim ^ln#(i?fe,Cfe) > lim ^ ln/(i?fc, C^), 

k — >+oo /c^ A; — >+oo /c^ 

SO the ratio ^{Rk,Ck)/I{Rk,Ck) grows as 7*^^ for some 7 = ^{R,C) > 1, as we 
clone margins {R,C) 1 — >■ {Rk,Ck)- 
By Lemma 1.4, we can write 

(2.3.2) lnp{R,C) - g{Z) > g{Y), 

where Y = (uij) is the independence matrix with j/y = ViCj/N for all i,j and 
g{X) = ^(^(xy + l)ln{xij + 1) -Xylnxy^. 

On the other hand, it is easy to check that 



(2.3.3) 



m n 

- ^r^lnri - ^Cjlnc^- 

+ NlnN + (N + mn) \n(N + mn) 



Let us consider the m x n matrix with the (i, j)-th entry equal to 
{viCj + N)/{N'^ + Nmn). The z-th row sum of the matrix is (rj + n)/{N + mn), 
the j-th column sum is (cj + m)/{N + mn) while the sum of all the entries of 
the matrix is 1. Using the inequality relating the entropies of two partitions of 
a probability space with the entropy of the intersection of the partition (see, for 
example, [Khi57]), we conclude that 



H 



rjCj + N 1 <i <m 



N{N + mn) ' l<j<n 
(2.3.4) <Il( "'^^'^ , l<i<m 

(Cj + m 
Jy + mn 

with the equality if and only if 

/o Q r\ n + n Cj + m riCj + N ^^ ■ ■ 

(2-3.5) — • -7 = -77-7 r for aU 

^ ' N + mn N + mn N{N + mn) 
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Identities (2.3.5) are equivalent to (N —rim){N —Cju) = 0, which, in turn, equivalent 
to all row sums being equal = N/m or all column sums being equal cj = N/n. 

Summarizing (2.2.1), (2.2.2), (2.3.2), and (2.3.3) we conclude that inequality 
(2.3.1) indeed holds if not all row sums are equal and not all column sums cj are 
equal. Therefore, in the space of km x kn matrices with the sum k'^N of all entries 
the two events 

TZk '■ the vector of row sums of a matrix is Rk 
(2.3.6) and 

Ck '■ the vector of column sums of a matrix is Ck, 

instead of being asymptotically independent, attract exponentially in k^, that is, 

Pr {Uk n Ck) ^ fe2 
(Pr7^fc)(PrCfc) - ^ 

for some 7 = 7(-R, C) > 1 and all sufficiently large k. 

Starting with non-constant margins (R, C) the cloning procedure {R, C) 1 — > 
{Rk, Ck) produces margins which stay away from from constant and maintain the 
density N/mn separated from 0. Similar analysis shows that the phenomenon 
of attraction of the events TZk and Ck defined by (2.3.6) holds for more general 
sequences of margins {Rk,Ck) of growing dimensions which stay sufficiently away 
from uniform and sparse. 

Two terms contribute to the difference ln#(i?, C) — ln/(i2, C): 

first, the difference g{Z) — g{Y), where Z is the matrix of Lemma 1.4 at which 
the maximum of the function g{X) = '^^iji^ij + l)ln(xij + 1) — Xij Inxij on the 
transportation polytope V{R, C) is attained and Y = {yij) is the independence 
matrix Hij = ViCj/N , cf. (2.3.2); 

and second, the difference (multiplied by (A^ + mn)) between the entropies on 
the right hand side of (2.3.4) and the left hand side of (2.3.4). 

As long as either of these differences remains large enough to overcome the error 
term of 0((m + n) In A^) coming from Theorem 1.1, we have the asymptotic positive 
correlation of sequences of events IZk and Ck in (2.3.6). 

On the other hand, the independence estimate /(-R, C) produces a reasonable 
approximation to ij^{R,C) in the cases of sparse tables (cf. [GM07]) and tables 
with constant margins (cf. [CM07a]). One can show that if all row sums are equal 
or if all column sums are equal then indeed 

^, ^™ T^ln#(i?fe,Cfe) = lim ^\nI{Rk,Ck), 

where {Rk,Ck) are cloned margins {R,C). Indeed, if all are equal then the 
symmetry argument shows that the matrix Z = (zij) in Lemma 1.4 satisfies Zij = 
Cj/m for all i and j, and, similarly, if all Cj are equal then we have Zij = Ti/n for 
all In either case we have Z — Y in (2.3.2) and, as we have already discussed, 
equations (2.3.5) hold as well. 
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3. The volume of a section of a simplex 



Let A be the affine hyperplane in defined by the equation 

d 



1=1 

and let A C ^ be the standard {d — l)-dimensional open simplex defined by the 
inequalities 

Xi > for i = 1, . . . , (i. 

We consider the Euclidean structure in A induced from M^. In particular, if C ^ 
is an m-dimensional convex body, by volm{K) we denote the m-dimensional volume 
of K with respect to that Euclidean structure. For m = d — 1 we denote vol^ just 
by vol. In particular, 

Let L C v4 be an affine subspacc intersecting A. Suppose that dim L = d — k — 1, 
so the the codimension of L in ^ is /c > 1. Our aim is to estimate the volume of the 
intersection vold_fc_i(Ln A) within a reasonable accuracy when the codimension k 
of L is small. It turns out that the volume is controlled by one particular quantity, 
namely the maximum value of the product of the coordinates of a point x & AnL. 

Our result is as follows. 

(3.1) Theorem. Let A G M.^ be the affine hyperplane defined by the equation 
xi + . . . + Xd = I and let A C A be the standard (d — 1) -dimensional open simplex 

defined by the inequalities xi > 0, . . . , Xd > 0- 

Let L a A be an affine subspace intersecting A and such that dimL = d — k — 1 
where k > 1. Suppose that the maximum of the function 



d 

i=l 



on A n L is attained at a = (cti, . . . , ad)- 
(1) We have 



Volrf-fc-i(AnL) ^ ^J_^d^fia)^ 

vol(A) d'^ujk ' 



where 



T{k/2+l) 



is the volume of the k- dimensional unit ball and j > is an absolute con- 
stant (one can choose 7 = l/2e^ ^ 0.025J. 
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(2) Suppose that 



Then 



Oil > for some e > and i = 1, . . . , d. 
d 



vold_fc_i (A n L) ^ fd 



vol(A) 

where 7 > is an absolute constant (one can choose 7 = 2e ~ 5.44^. 

We are interested in the situation of A; ~ Vd, so ignoring lower-order terms in 
the logarithmic order, we get 

vold_fe_i(A n L) ~ -e^(«) ~ n 

i=l 

provided the maximum value of the product of the coordinates of a point a; e A fl L 
is attained at a = (ai, . . . , a^) and all ai are not too small. 
Let 

be the center of the simplex A. 

We deduce Theorem 3.1 from the following result. 

(3.2) Theorem. Let A d he the affine hyperplane defined by the equation 
xi + . . . + Xd = i and let A G A be the standard {d — 1) -dimensional open simplex 

defined by the inequalities Xi > 0, . . . , > 0. 

Let H G A be an affine hyperplane in A intersecting A. If H does not pass 
through the center c of A, let H~ G A be the open halfspace bounded by H that 
does not contain c and if H passes through c let H~ G A be either of the open 
half spaces bounded by H. 

Suppose that the function 

d 

f{x) = '^^\n.Xi where x = {x\, . . . ,Xd) 

attains its maximum on An H at a point a = {ai, . . . , aa)- 
Then, for some absolute constant 7 > w;e have 

d^,na) > voi(An^-) > 7 

vol(A) 



We can choose 7 = l/2e^ 0.025. 
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4. Preliminaries from convex geometry 



We recall that ^ C R'' is the affine hyperplane defined by the equation xi + 
. . . + Xd = 1, that A C ^ is the open simplex defined by the inequalities Xi > for 
z = 1, . . . , (i, and that c = (l/d, . . . ,1/d) is the center of A. We need some results 
regarding central hyperplane sections of A. 

(4.1) Lemma. Let H (Z A be an affine hyperplane in A passing through the center 
c of A. 

(1) Let and H~ be the open halfspaces bounded by H. Then for some 
absolute constant j > we have 

vol(Anif+) , vol (An if-) 

> 7 and ^ ^ > 7. 

vol(A) - ' vol(A) - ' 

One can choose 7 = 1/e ~ 0.37. 

(2) For some absolute constant 7 > w;e have 

void-2(Ang) ^ 

vol(A) - ^' 
One can choose 7 = l/2e ~ 0.18. 

Proof. Part (1) is a particular case of a more general result of Griinbaum [Grii60] 
on hyperplane sections through the centroid of a convex body. In fact, in dimension 
d one can choose 



As K. Ball and M. Fradelizi explained to the author, a stronger estimate than 
that of Part (2) can be obtained by combining techniques of [Bal88] and [Frad97] . 
Nevertheless, we present a proof of Part (2) below since the same approach is used 
later in the proof of Theorem 3.1. 

To prove Part (2), let H-^ C ^ be a line orthogonal to H. Let us consider 
the orthogonal projection pr : A — > and let Q = pr{A) be the image of the 
simplex. Since A is contained in a ball of radius 1, Q is an interval of length at 
most 2. 

Let yo = pr{H) and for j/ e Q let 

= vold_2 {pr~^ {y)) 

be the volume of the inverse image of y. By the Brunn- Minkowski inequality, the 
function f is log-concave, see [Bal88], [Bal97]. 

Our goal is to bound z^(yo) from below. The point yo splits the interval Q into 
two subintervals, = pr(A fl H^) and Q~ = pr(A fl H~) of length at most 2 
each. 
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We have 

/ u{y) dy = vol {A n H+) and / u{y) dy = vol {An H^) . 

JQ+ Jq- 

Using Part (1) we conclude that there exist y'^ e and y~ e Q~ such that 

u{y+) 1 v{y-) 1 

^ml{Kj - Ye ^ml{A) - Ye' 

Since j/o is a convex combination of y"*" and by the log-concavity of v we 
must have 



vol(A) - 2e' 

as desired. □ 

Let us choose a point a = (cki, . . . , 0;^) in A, and let us consider the •projective 
transformation Ta : A — > A 

Ta{x)^y where yi = , , for 

aixi + . . . + aaXd 

x= {xi,... ,Xd) and y = {yi, . . . ,yd) ■ 
The inverse transformation is for 6 = (a^^, . . . ,0;^^)- Clearly, 

Taic) = a, 

where c is the center of A. For a; e A, the derivative DTa{x) is a linear transfor- 
mation 

DTaix): n^n, 

where Ti is the hyperplane xi + . . . + Xd — in M.'^ . 

Our immediate goal is to compute the Jacobian \DTa{x) \ at a: e A. 

(4.2) Lemma. Let us choose a point a — (cti, . . . , a^) in the simplex A and let us 
consider the projective transformation 

Ta-.A^A 



defined by the formula 

Ta{x) = y where yi = ■ ■ 

aixi + . . . + aaXd 

forx = {xi,... ,Xd) andy= (yi,... ,yd)- 
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Let DTa{x) : 7i — ^ TC be the derivative of Ta at x & A and \DTa{x)\ the 
corresponding value of the Jacobian. Then 

- ■ ■ (\ a 



\DTaix)\ = '"-^ . 

{aixi + . . . + adXdY 



Proof. Let us consider as defined in a neighborhood of x in with values in 
and let DTa{x) be the dx d matrix of the derivative 



DTa{x) = 




in the standard basis of M*^. Then the i-th diagonal entry of DTa{x) is 

2 

Oil (X^ X'i 

(aixi + . . . + adXd) {aixi + . . . + adXd)'^ ' 
while the {i, j)-th entry for i ^ j is 

{aixi + . . . + adXd)'^' 

Let 

P = , 

aixi + . . . + adXd 

let B be the diagonal matrix with the diagonal entries oi, . . . , ad, and let C be the 
matrix with the (i,j)th entry equal to aiajXi for all 1 < i,j < d. Then we can 
write 

DTa{x) =pB- P^C = P{B - pC). 

Since DTa{x) maps M.^ onto H and H is an invariant subspace of DTa{x), the 
value of the Jacobian we are interested in is the product of the {d — 1) non-zero 
eigenvalues of DTa{x) (counting algebraic multiplicities), which is equal to the 
{d — l)-st elementary symmetric function in the eigenvalues of DTa{x), which is 
equal to the sum of the d of the (d — 1) x (d — 1) principle minors of DTa{x). 

Let Bi and Cj be the (d—l) x (d—l) matrices obtained from B and C respectively 
by crossing out the zth row and column. 

Hence 

Bi-pCi = Bi{I-pBr^Ci), 

where / is the {d — 1) x {d — 1) identity matrix. Now B~^Ci is a matrix of rank 1 
with the non-zero eigenvalue equal to the trace of B^^Ci, which is 



16 



Hence 

det (/ - PBr^Ci) = 1 - /3 ^ ajXj ""^^^ 



aixi + . . . + adXd 



and 



aixi + . . . + adXd 

Therefore, the sum of the d oi {d — 1) x {d — 1) principle minors of 5 — pC is 

aixi + . . . + adXd 

and the sum of the {d — 1) x {d — 1) principle minors of DTa{x) — (3{B — j3C) is 

Oil • • - ctd 

{aixi + . . . + adXdY' 

as desired. □ 

Next, we will need a technical estimate, which shows that if the volume of the 
section of the simplex by an affine subspacc of a small codimension is sufficiently 
large and if the subspace cuts sufficiently deep into the simplex then a neighborhood 
of the section in the simplex has a sufficiently large volume. 

(4.3) Lemma. Let L d A he an affine subspace, dimL = d — k — 1. Suppose that 
there is a point a G L fl A, a — (cii, . . . , ad) such that 

Oii> for 1 = 1,... ,d 
d 

and some e > 0. 
Let 

Halloo = niax{ for i = 1, . . . , d} for i = l,...,d 
and let us define a neighborhood Q of An L by 

Q = |a; e A : \\x — y\\oo ^ ^ for some j/ e A n l|. 

Then, for any affine hyperplane H <Z A passing through L we have 

voi(gniy+), yo\{q n h-) > ^ void-fe-i(AnL), 

where and H~ are the halfspaces bounded by H and ^ > is an absolute 
constant. One can choose 7 = l/2e ~ 0.18. 

Proof. Let 

fl d-1 ^ ^ 

Qo=-{-,ci'-\ xeAnL 

{d d 
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Since Qo is the contraction of A fl L we have 



; 1 \ d—k — 1 1 

a — 1\ , , . . 1 



vo\d-k-iQo = vold_fc-i(AnL) > -vold_fc-i(AnL). 

Moreover, for any x e Qo, x = {xi, . . . , Xd), we have 



e 

^2 



Xi > — for i = 1, . . . , 



For every point x E Qo let us consider the cube 

4 = |l/eM'^: \\y-x\\oc<^]. 

Then (/a; fl ^) C A. The intersection of with the A;-dimensional affine subspace 
L-^ C ^ orthogonal to L and passing through x is centrally symmetric with respect 
to X and, by Vaaler's Theorem [Vaa79], satisfies 

vol, (4nL^)>(^^' 

The proof now follows. □ 

5. Proofs of Theorems 1.2, 3.1, and 3.2 

We prove Theorem 3.2 first. 

(5.1) Proof of Theorem 3.2. If c E H the result follows by Lemma 4.1. Hence 
we assume that c ^ H. 

The hyperplane H is orthogonal to the gradient of f{x) ai x = a and passes 
through a, from which it follows that H can be defined in A by the equation 

d 

X 



• ^ rv ■ 



while the halfspace H~ is defined by the inequality 

d 



i=l " 

Let us consider the projective transformation : A — > A defined by the 
formula of Lemma 4.2. Hence Ta{c) = a. Moreover, the inverse image T~^[H) is 
the hyperplane Ho defined in A by the equation 

d ^ 
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and the inverse image T^^^ {Af] H ) is the intersection Af] Hq , where Hq is the 
halfspace defined by the inequahty 



1 

By Lemma 4.2, we have 



|L»Ta(a;)| = ^^^^^^ ^ < d'^ JJ a, for all a; e A n . 

(aixi + . . . + adXd) 

Since 

(5.1.1) voi(Aniy-)= /" |i:>T„(a;)| dx, 

JAnR- 

the upper bound follows. 

Let us prove the lower bound. By Part (2) of Lemma 4.1, 

volrf-2 (A n Ho) ^ \_ 
vol(A) - 2e' 

We recall that Hq passes through the center of the simplex and apply Lemma 4.3 
with e = 1. Namely, we define 

Q = e A : ||a; — < for some y G A fl i^oj 
and conclude that by Lemma 4.3 

vol(Qnifo-) > (^) (^)^°l'^-2(^n^o) > ^vol(A). 
We note that for every x E Q we have 

A 1 1 _d+l 

i=l 



By (5.1.1) 



l(Aniy-)> / \DTa{x)\dx> l:^] yo\{QnH^)T[a, 



which completes the proof. □ 
Next, we prove Theorem 3.1. 
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(5.2) Proof of Theorem 3.1. The proof of Part (1) is similar to that of Part (2) 
of Lemma 4.1. Let L-*- C ^ be a A;-dimensional subspace orthogonal to L in ^ and 
let 

pr : A — > L"*" 

be the orthogonal projection. Let Q C L-*-, Q = pr{A), be the image of the simplex. 
Clearly, Q lies in a ball of radius 1, so 

VOlkQ < OJk- 

For y e Q, let 

u(y) = voU-k-i {pr~^{y)) 

be the volume of the inverse image of y. By the Brunn-Minkowski inequality, the 
function u is log-concave, so for every a > the set 

|y e g : u{y) > a} 

is convex. Moreover, for all Borel sets Y C Q we have 

J T^{y) dy = vol {pr~^{Y)) . 

We want to estimate ^{yo) for yo = pr{L). Let H C L-*- be an afhne hyperplane 
through I/O and let H~^, H~ C L-*- be open halfspaces bounded by H. Then H = 
pr~^{H) is an affine hyperplane in A containing L and pr~^{H~) and pr^^{H^) 
are the corresponding open halfspaces of A bounded by H. 

Since the maximum value of / on A fl ^ is at least as big as the maximum value 
of / on A n L, by Theorem 3.2 we have 

volpr"^ {H^ n Q) > ;5^4rf'^e^(") vol(A). 

Since 



yo\pr-^{H^nQ)= f i^{y) dy, 

JH^nQ 



We conclude that there exist points y'^ e and y~ e H~ such that 

In other words, for any affine hyperplane H C L"*- through y^ on either side of the 
hyperplane there are points y^ , y~ for which inequality (5.2.1) holds. Hence yo li^s 
in the convex hull of points y for which the inequality holds. The proof of Part (1) 
follows by the log-concavity of v. 
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Let us prove Part (2). Since a is the maximum point of the strictly concave 
function 

d 

on A n L, the gradient of / at a is orthogonal to L. Hence L is orthogonal to the 
vector 

- -"l 

, . . . , I . 



If a 7^ c, let i7 C ^ be the affine hyperplane defined by the equation 

d 



1=1 



and if a = c let -ff be any affine hyperplane containing L. In either case L G H and 
the maximum values of / on A n and on A fl L coincide and are equal to /(a). 
Therefore, by Theorem 3.2, we have 

(5.2.2) vol(Ani?-) < dV('*)vol(A) 

for some open halfspace H~ bounded by H. 
We apply Lemma 4.3. Namely, we let 

Q = |a; e A : ||a; — y||oo<^ for some yeAnLj. 

Then, by Lemma 4.3, 

voi(gnif-) > ^(^J void-fe-i(AnL). 

Since 

voi(Qniy-) < voi(Aniy-), 

we get the upper bound from (5.2.2). □ 
Finally, we prove Theorem 1.2. 

(5.3) Proof of Theorem 1.2. Let us consider the contracted poly tope N~^V 
defined by the equations 

n m 

^^i3 = ^ for z = 1, . . . , m, ^ Xij = for j = 1, . . . , n 
and inequalities 

Xij > for all 
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Then N~^V can be represented as an intersection of the standard simplex in the 
space of m X n matrices and an affine subspace of dimension (m — — 1). We 
are going to use Theorem 3.1. Let A = (ctij), A G N~^V, be the point maximizing 
the product of the coordinates. Writing the optimality condition for 



on N ^V, we conclude that 



— = Aj + Hj for all i,j 

and some Ai, . . . A^ and /Ui, . . . , Since A^ + j > for all i, j, we may assume 
that Ai,/Uj > for all If A^ > nN/vi for some i then aij < Vi/nN for all j, 
which is a contradiction. If jij > mN/cj for some j then < Cj/mN for all i 
which is a contradiction. Hence A^ < nN/ri for z = 1, . . . , m and Hj < mN/cj for 
J = 1, . . . , n, from which 

^ 1 ( u ■ ■ 

^ - Nncj + Nmri (nN/n) + (mN/cj) 

The proof now follows by Theorem 3.1 with d = mn, k = m + n — 2, and 

/ N iV^ 
e = I n max \- m max — 

y i=l,... ,m n j = l,--- ,n Cj 

□ 

6. An integral representation for the number of contingency tables 
In this section, we recall bounds for C) obtained in [Ba07] and [Ba08]. 

(6.1) Matrix scaling. Our estimates for the number C) of contingency 

tables essentially use the theory of matrix scaling, see [Si64] , [M068] , [RS89] . Let 
us fix non-negative vectors R = (ri, . . . , Tm), C = (ci, . . . , c„), such that 

m n 

i=i j=i 

Then for every mxn positive matrix X = (xij) there exist a positive mxn matrix 
L = (lij) and positive numbers Ai, . . . , A^ and /xi, . . . , Hn such that 

n 

y^Jij = ri for i = 1,.. . ,m, 

3 = 1 

(6.1.1) - 

y lij = Cj for J = 1, . . . , n, and 

1=1 

Xij = XijJLjlij for all 
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Moreover, given X, the matrix L is unique while the numbers Aj and jij are unique 
up to a re-scahng: 

Xi I — > XiT for i = 1, . . . ,m 

IXj I — > lJ^j'^~^ for J = 1, . . . , n 

and some r > 0. 

(6.2) Function 0. This allows us to define a function 



n 

i 

i=l / \j = l 



where and //j are numbers such that equations (6.1.1) hold, on positive m x n 
matrices X. It turns out that (f) is continuous (it is also log-concave but we don't 
use that), positive homogeneous of degree A^, 

(l){aX) = a^(t){X) 

for q; > and positive matrix X, and monotone 

0(x) > 0(y) 

provided X and Y are positive matrices satisfying Xij > i/ij for all see, for 
example, [Ba07] and [Ba08]. 

Alternatively, (f){X) can be defined by 



<f){X) = inin ^Yl^ij^^f^- 



N 
'3 I ' 



where the minimum is taken over all positive m-vectors a = (ai, . . . ,am) and 
positive n- vectors b — {Pi, ... , /?„) satisfying 

m n 

n«r=nf?=i. 

i=i j=i 

see also [M068]. 

(6.3) The bounds. Let us identify the space of m x n matrices with Euclidean 
space IR^ for d = mn, let ^ C M'' be the affine hyperplane defined by the equation 
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and let A C ^ be the standard open simplex defined by the inequalities 



Xij > for all i,j 

with the Lebesgue measure dX induced from the Euclidean structure in M.'^. It is 
proved in [Ba07] and [Ba08] that 

(6.3.1) and 

Therefore, we have an approximation within up to an some 
absolute constant 7 > 0: 

(6.3.2) i^{R,C) ^ e^{N + mny. [ (p{X) dX 

J A 

In fact, we will be using only a lower bound in (6.3.1). 

For completeness, let us sketch the main ingredients of the proof of (6.3.1). 

Recall that the permanent of an N x N matrix A = (a^) is defined by the 
formula 

N 

per^= ^ J]^ai^(j), 

where the sum is taken over all N\ permutations a from the symmetric group Sn. 
For an m X n matrix X = (xij) let us define the N xN block matrix A{X) that has 
mn blocks of sizes x Cj for i = 1, . . . , m and j = 1, . . . ,n with the (i, j)-th block 
filled by the copies of Xij. A combinatorial computation produces the following 
expansion 

/ m \ I ^ \ 1 

per^(X)= {l[r,l] J] 



d- 



where the sum is taken over all m x n non-negative integer matrices D = (dij) with 
row sums R and column sums C. From this expansion we obtain the formula 



# 



(«.t^)= (fi;^) (n^j /_^per-4mexp j-X:.-.} dX, 
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where ]R^|_ is the set oimxn positive matrices X, see Theorem 1.1 of [Ba08]. Since 
per is a homogeneous polynomial of degree N m. a, standard change of 

variables results in the formula 



n 



cf. Lemma 4.1 of [Ba08]. Given a matrix X e A, let Ai, . . . , and //i, ...,// 
be its scaling factors so that (6.1.1) holds. Let B{X) be the matrix obtained by- 
dividing the entries in the (z, j)-th block of A{X) by XiVifijCj, so the entries in the 
{i,j)-th block of B{X) are equal to hj/riCj. Hence 

cf. Section 3.1 of [Ba08]. Now we notice that B{X) is a doubly stochastic matrix, 
that is, a non-negative matrix with row and column sums equal to 1. The classi- 
cal estimate for permanents of doubly stochastic matrices conjectured by van der 
Waerden and proved by Falikman and Egorychev (see [FaSl], [Eg81], and Chapter 
12 of [LWOl]) asserts that 

Nl 

P^'B{X)>j^ 

and hence the lower bound in (6.3.1) follows. The upper bound in (6.3.1) follows 
from the inequality for permanents conjectured by Mine and proven by Bregman, 
(see [Br73] and Chapter 11 of [LWOl]), which results in 



per 



B(x) < min nSr. n^j 



i=l » j=l ^3 



since the entries in the (i, j)-th block of B{X) do not exceed min{l/ri, 1/cj}, see 
Section 5 of [BaOS] for details. 

(6.4) Slicing the simplex. The crucial observation which makes the integral 

/ 0(X) dX 

J A 

amenable to analysis is that the simplex A can be sliced by affine subspaces of 
codimension m + n — 1 into sections on which function </> remains constant. 

Let us choose some positive Ai, . . . , A^ and /Ui, . . . , and let us consider the 
affine subspace L C M*^ of m x n matrices X = (xij) satisfying the equations 

n 

~ ''^i i = 1, . . .m 

(6.4.1) 



m 



V--^ = Cj for j = l,...,n 
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Clearly, 

(6.4.2) = (^n^rj l^n^^^j fo^^ll XeAnL. 

Moreover, dimL = (m — l){n — 1). 

(6.5) Modification for weighted tables. Similar identities an inequalities hold 
for weighted tables. For a positive matrix W — {wij) of weights, we define the 
function 

(f>R,C;w{X) = (j)R,c{Y) where yij = WijXij for aU ij 
and (f)R^c is the unweighted function defined in Section 6.2. Then 

Nl{N + mn- 1)1 



T{R,C;W) > 



X 



V 



(6.5.1) and 

{N + mn- 1)! 



T{R,C : W) < 



I i=l j=l I 



see [Ba07], [Ba08], and the proof sketch in Section 6.3. 

Let us choose some positive Ai, . . . , A^^ and /Ui, . . . , /U„ and let us consider the 
subspace L C M*^ of m x n matrices X — {xij) satisfying the equations 



(6.5.2) 



Clearly, 



— - = Tj tor I — 1^ . . .m 



m 



, = Cj for j = l,...,n 

1=1 •' 



m 



(6.5.3) (l>R,cMX) =(ll^?][ll I for aU XeAnL. 



Moreover, dimL = (m — l)(n — 1). 
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7. Proofs of Theorems 1.1 and 1.3 and Lemma 1.4 
We prove Lemma 1.4 first. 
(7.1) Proof of Lemma 1.4. It is straightforward to check that the function 

g{x;w) = {x + l)lii{x + 1) — X lux + X law for a; > 

is strictly concave for x > 0. Therefore, the maximum of g{X; W) on V{R, C) is 
attained at a single point Z = (zij). Let us show that necessarily Zij > for all 

hj- 
Since 



g'{x; w) — In ^ — — — ^ + In 



the derivative of g{x; at a; > is finite and the right derivative at x = is +oo. 
Let Y G V{R,C) be a matrix with positive entries, for example, Y = {yij) where 
Vij — fiCj/N. If Zij — for some i,j then 

g{{l-e)Z + eY;W) > g{Z;W) 

for some sufficiently small e > 0, which is a contradiction. 

Thus Zij > for all i,j and hence Z lies in the relative interior of V{R,C). 
Therefore the gradient of g{X;W) at X = Z is orthogonal to the affine span of 
V{R,C), that is. 

In ( — ] + In Wij = Xi + jij for all i,j 

and some Ai, . . . , Am, l^i, ■ ■ ■ , l^n- 
Let 

^i = e"'^* > for z = 1, . . . , m and r]j = e~^^ > for j = 1, . . . , n. 
Then 



+ 1 



and 



(7.1.1) Zij = ^ for all 



In particular. 



^ ^^-^ — = 7^^ for z = 1, . . . , m and 



(7.1.2) 



^■=1 - Wij^iTJj 



m 



^ for j = l,...,n. 

1 - WiAtiVn 
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Equations (7.1.2) are equivalent to the statement that the point 
t* = (Ai, . . . , Xm) and s* = {fii, . . . , fin) is a critical point of the function 

m n 

(^(t, s) = ^ TiU + CjSj - ^ In (1 - WijC-*'-^^) . 

Since (p is convex, the point (s*,t*) is a minimum point of (p and hence the point 
X* = (^1, . . . , ^rn) and y* = [rji, . . . , r}n) is a point where the infimum of 

is attained in the region xi, . . . , Xm > Hi, ■ ■ ■ ,yn > 0, and WijXii/j < 1 for all 
Using (7.1.1) and (7.1.2), we conclude that 

g{Z-W) ^^{zij + l)\n{zij + 1) -^Zijilnzij -InWy) 

ij ij 

ln(l - Wij^iTjj) ^ WijiiTfj ( iir]j 



^ 1 - Wij^i-q-j ^ 1 - Wij^i-qj ""VI- Wij^iVj 

m I ^ C 

n I ^ t 



j=i 



^ 1 - Wijiir]j 



= -YnU-Ycjr]j - J^ln(l -Wij^^ryj) - lnF(x*, y*; W^), 

as claimed. 

We observe that the value of F(x, y; W) does not change if we scale Xi i — > xit, 
Uj I — > yj'T~^ for r > 0. In the case of Wij = 1 for all i,j we have ^irjj < 1 for all 
i,j and hence by choosing an appropriate r we can enforce < ^1,77^ < 1 for all 

□ 

We consider the space for d = mn oimxn real matrices, the affine hyperplane 
.4. C M*^ defined by the equation 'Yliij ^ij — 1 the standard open simplex A C ^ 
defined by the inequalities Xij > for all Let (f) = (f)R,C;W be the function 
defined in Sections 6.5 and 6.2. 

We start with a technical lemma, which is a straightforward modification of 
Lemma 4.3. 
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(7.2) Lemma. Let L (Z A be an affine suhspace, dimL = d — k — 1 for k > 1. 
Suppose that there is a point A = (a^ ), A & A n L, such that 

Suppose further that the value of the function cj) = (f)R,C;W on An L is constant and 
equal to r. Then 

for some absolute constant 7 > (one can choose 7 = ~ 0.14). 
Proof. Let 

Qo = ^^A + : X e A n l| . 

As in the proof of Lemma 4.3, we have 

vold-A;-i(Qo) > - vold_fe-i(A nL) 
e 



and for any X & Qq, X = (xij), we have 
Let us define Q by 



Xij > for all i,j. 



Q^^XeA: \\X-Y\\oo < ^2(^ + 1) for some Y e Qo 



Then, as in Lemma 4.3, we have 



1 / 2e ^ ^ 



We note that for every X E Q there is a F G A Pi L such that 



I 1 I Vij for all i, j. 



Since is monotone and homogeneous of degree N (see Section 6.2) , we have 



1 \^ 1 



Since 



(f){x) > ( 1 - ]yqrY j ^ ^ X eQ. 

I (j){X) dX> I (j){X) dX, 

J A Jo 



'Q 

the proof follows. □ 
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(7.3) Proof of Theorem 1.1. The upper bound follows immediately from the 
standard generating function expression: 

^ m n 

T] - = y#{R,C)xV, where x'' = Y[xl' and = T] y"/ 

i,j R,C i=l j=l 

and the some is taken over all pairs of positive integer m- vectors R = (ri, . . . , Tm) 
and n-vectors C = (ci, . . . , c^) such that ri + . . . + rm = ci + . . . + Cn- 
Let us prove the lower bound. By Lemma 1.4 the minimum of 



m 

-ri 



F(x,y)= 

\i=i / \j 

on the open cube < Xi,yj < 1 for all i,j is attained at a certain point 

X* = (6, • • • , '^m) and y* = {r]i,... ,r]n), 
which, moreover, satisfies 



Yl 1 \' z=l,...,m 

(7-3.1) 

Equations (7.3.1) can also be obtained by setting the gradient of InF to 0. 

In the space of m x n matrices M'^ with d — mn, let us consider the standard 
simplex A and the point A = (aij) defined by 



1 

(iV + mn)(l -Ci?7j) 



By (7.3.1), we have 

^,^-^^VJ ^,l-^iVj j^.^-iim 



so A lies in A. Let 



\i = = for i = 1, . . . ,m and 

^ivN + mn 

jij = for j = 1, . . . , n. 

rjjy/N + mn 
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Let us consider the affine subspace L cM.^ defined by the system of equations 



n 

Xi 



— ''^i for i = 1, . . .m 



m 

X 



J2Y^ = Cj for j = l,...,n. 

Hence diniL = (m — l)(n — 1) and ^ e L by (7.3.1). 

By (6.4.2), the density (f) = (f)R,c is constant on L and equal to 

By Part (1) of Theorem 3.1, the volume of the section A D L within a factor of 
{N + mn)^ is at least 



gmn 



{N + mny 



More precisely, for k = d — 1 — dim( A n L) we have k — m + n — 1 or k — 171 + 71 — 2 
and 

r(/e/2 + l) (mn)"^" 1 f 1 

vOd_i_fc( n ) > 2e37r'=/2v& (mn)! {N + mn)^^ \\j~l-^ir]j 

Choosing e = 1/{N + mn) in Lemma 7.2, we estimate the integral 

(j){X) dX 



within a factor of from below by 



(iV + mn)^+"^^ V / l-LJ-''-? 

^ ' \i=l / \j = l 

More precisely, 

r(i^) 



ni-e...) 



/ 0(^) 

J A 



dX > 



26%^^ ^ ^ (mn)2(iV + l)(iV + mn) 



m+n— 1 



X 



X 



(mn)! (iV + mn)^+^'^ 



n 

-c 

U=l / \j=l 
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provided m + n > 10. Hence by (6.3.2) the number ^{R,C) is estimated from 
below within a factor of {N + mn)'^^'"+"') by 



e^{N + mn)l [ 4>{X) dX 

J A 



e^^+^{N + mny. 



X 



Ki=l / \j=l J \ ij 

where "~" stands for an approximation within a NOim+n) f^c^or. 
More precisely, by (6.3.1) 

-p / m+n \ / o \ rn+n—l 



2e^7r''^mn{N + mn) \{mn)^iN + 1){N + mn) 



n c 



X 



n W TT i mN + mn)\imnr- 
II rA i \ He,! I NN{N + mn)^+^^{mn)r^ 



provided m + n > 10. □ 
The proof of Theorem 1.3 is a straightforward modification of the proof of The- 
orem 1.1. 

(7.4) Proof of Theorem 1.3. The upper bound follows from the generating 
function expression 



ij R,C 

Let us prove the lower bound. Since T(i?, C;VF) is a polynomial in VF, without 
loss of generality we assume that W is a. strictly positive matrix. Let 

X* = {ii,-- - ,im) and = (?7i, . . . ,?7n) 
be the minimum point of 



F(x,y;iy)= in 
see Lemma 1.4. Then 



[^(n^r)(nr3^). 



(7.4.1) ' 



= Ti for z = 1, . . . , m 



^ «M!^=e, for j = l,...,n. 
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In the space of matrices, let us consider the standard simplex A and the matrix 
A = {aij) ^ 

an — — - for all z, j. 

As in the proof of Theorem 1.1, we check from (7.4.1) that indeed A e A. Let 

Aj = : for i = 1, . . . , m and 

^ivN + mn 

jij = , for j = 1, . . . , n. 

rjjvN + mn 

Let us consider the affine space L C M'^ defined by the equations 

n 

— - = Ti tor z = 1, . . . , m 

m 

-^^ = Cj for j = l,...,n. 
Then A e L, the value of (j)R,C;W on A fl L is constant and equal to 

see (6.5.2)-(6.5.3). Next, we use the lower bound in (6.5.1) and the proof proceeds 
as for Theorem 1.1. □ 
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